Overview

Dataset statistics

Number of variables25
Number of observations560844
Missing cells2902243
Missing cells (%)20.7%
Duplicate rows3864
Duplicate rows (%)0.7%
Total size in memory103.2 MiB
Average record size in memory193.0 B

Variable types

CAT18
DATE3
NUM3
BOOL1

Reproduction

Analysis started2021-01-30 07:21:11.539317
Analysis finished2021-01-30 07:22:05.863572
Duration54.32 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Dataset has 3864 (0.7%) duplicate rows Duplicates
CLOSE-DT has a high cardinality: 4684 distinct values High cardinality
CREDIT-LIMIT/SANC AMT has a high cardinality: 877 distinct values High cardinality
DISBURSED-AMT/HIGH CREDIT has a high cardinality: 72426 distinct values High cardinality
INSTALLMENT-AMT has a high cardinality: 50133 distinct values High cardinality
CURRENT-BAL has a high cardinality: 147445 distinct values High cardinality
OVERDUE-AMT has a high cardinality: 22385 distinct values High cardinality
REPORTED DATE - HIST has a high cardinality: 57846 distinct values High cardinality
DPD - HIST has a high cardinality: 134338 distinct values High cardinality
CUR BAL - HIST has a high cardinality: 447072 distinct values High cardinality
AMT OVERDUE - HIST has a high cardinality: 187341 distinct values High cardinality
AMT PAID - HIST has a high cardinality: 83417 distinct values High cardinality
DISBURSED-DT has 32150 (5.7%) missing values Missing
CLOSE-DT has 251758 (44.9%) missing values Missing
LAST-PAYMENT-DATE has 319283 (56.9%) missing values Missing
CREDIT-LIMIT/SANC AMT has 545685 (97.3%) missing values Missing
INSTALLMENT-AMT has 420509 (75.0%) missing values Missing
INSTALLMENT-FREQUENCY has 425135 (75.8%) missing values Missing
OVERDUE-AMT has 118891 (21.2%) missing values Missing
WRITE-OFF-AMT has 19123 (3.4%) missing values Missing
ASSET_CLASS has 300376 (53.6%) missing values Missing
REPORTED DATE - HIST has 19123 (3.4%) missing values Missing
DPD - HIST has 19647 (3.5%) missing values Missing
CUR BAL - HIST has 19123 (3.4%) missing values Missing
AMT OVERDUE - HIST has 19123 (3.4%) missing values Missing
AMT PAID - HIST has 20294 (3.6%) missing values Missing
TENURE has 368107 (65.6%) missing values Missing
WRITE-OFF-AMT is highly skewed (γ1 = 283.4678315) Skewed
WRITE-OFF-AMT has 540655 (96.4%) zeros Zeros
TENURE has 18051 (3.2%) zeros Zeros

Variables

ID
Real number (ℝ≥0)

Distinct count128655
Unique (%)22.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67401.69835997175
Minimum1
Maximum143395
Zeros0
Zeros (%)0.0%
Memory size4.3 MiB

Quantile statistics

Minimum1
5-th percentile6329
Q132434
median61399
Q3101283.25
95-th percentile135844
Maximum143395
Range143394
Interquartile range (IQR)68849.25

Descriptive statistics

Standard deviation41016.1773
Coefficient of variation (CV)0.6085332906
Kurtosis-1.146697158
Mean67401.69836
Median Absolute Deviation (MAD)33280
Skewness0.2017691519
Sum3.780183812e+10
Variance1682326800
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
710604200.1%
 
141732165< 0.1%
 
51786152< 0.1%
 
1167138< 0.1%
 
97794124< 0.1%
 
128175120< 0.1%
 
28072112< 0.1%
 
126242105< 0.1%
 
54465100< 0.1%
 
50398< 0.1%
 
Other values (128645)55931099.7%
 
ValueCountFrequency (%) 
19< 0.1%
 
213< 0.1%
 
331< 0.1%
 
74< 0.1%
 
87< 0.1%
 
ValueCountFrequency (%) 
1433951< 0.1%
 
1433941< 0.1%
 
1433934< 0.1%
 
1433911< 0.1%
 
1433902< 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size547.7 KiB
False
396523
True
164321
ValueCountFrequency (%) 
False39652370.7%
 
True16432129.3%
 

MATCH-TYPE
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
PRIMARY
560647
SECONDARY
 
197
ValueCountFrequency (%) 
PRIMARY560647> 99.9%
 
SECONDARY197< 0.1%
 

Length

Max length9
Median length7
Mean length7.000702513
Min length7

ACCT-TYPE
Categorical

Distinct count50
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
Tractor Loan
186242
Gold Loan
91024
Business Loan Priority Sector Agriculture
80084
Kisan Credit Card
 
33369
Auto Loan (Personal)
 
29575
Other values (45)
140550
ValueCountFrequency (%) 
Tractor Loan18624233.2%
 
Gold Loan9102416.2%
 
Business Loan Priority Sector Agriculture8008414.3%
 
Kisan Credit Card333695.9%
 
Auto Loan (Personal)295755.3%
 
Other272264.9%
 
Commercial Vehicle Loan195223.5%
 
Two-Wheeler Loan158052.8%
 
Credit Card127022.3%
 
Consumer Loan121242.2%
 
Other values (40)531719.5%
 

Length

Max length67
Median length12
Mean length17.33400019
Min length5

CONTRIBUTOR-TYPE
Categorical

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
NBF
256835
NAB
173895
PRB
75353
RRB
 
24023
COP
 
22201
Other values (7)
 
8537
ValueCountFrequency (%) 
NBF25683545.8%
 
NAB17389531.0%
 
PRB7535313.4%
 
RRB240234.3%
 
COP222014.0%
 
MFI33700.6%
 
HFC27440.5%
 
CCC14700.3%
 
FRB7550.1%
 
SFB187< 0.1%
 
Other values (2)11< 0.1%
 

Length

Max length3
Median length3
Mean length3
Min length3
Distinct count2652
Unique (%)0.5%
Missing3683
Missing (%)0.7%
Memory size4.3 MiB
Minimum2008-03-01 00:00:00
Maximum2020-08-08 00:00:00
Histogram

OWNERSHIP-IND
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
Individual
483844
Joint
 
37425
Guarantor
 
34876
Primary
 
4561
Supl Card Holder
 
138
ValueCountFrequency (%) 
Individual48384486.3%
 
Joint374256.7%
 
Guarantor348766.2%
 
Primary45610.8%
 
Supl Card Holder138< 0.1%
 

Length

Max length16
Median length10
Mean length9.581245409
Min length5

ACCOUNT-STATUS
Categorical

Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
Closed
320255
Active
201897
Delinquent
 
32457
Written Off
 
2937
Suit Filed
 
2062
Other values (6)
 
1236
ValueCountFrequency (%) 
Closed32025557.1%
 
Active20189736.0%
 
Delinquent324575.8%
 
Written Off29370.5%
 
Suit Filed20620.4%
 
Settled6260.1%
 
Restructured5110.1%
 
SUIT FILED (WILFUL DEFAULT)70< 0.1%
 
WILFUL DEFAULT27< 0.1%
 
Sold/Purchased1< 0.1%
 

Length

Max length27
Median length6
Mean length6.281985722
Min length6

DISBURSED-DT
Date

MISSING

Distinct count6279
Unique (%)1.2%
Missing32150
Missing (%)5.7%
Memory size4.3 MiB
Minimum1900-01-02 00:00:00
Maximum2020-02-11 00:00:00
Histogram

CLOSE-DT
Categorical

HIGH CARDINALITY
MISSING

Distinct count4684
Unique (%)1.5%
Missing251758
Missing (%)44.9%
Memory size4.3 MiB
2019-06-01 00:00:00
 
701
2017-07-31 00:00:00
 
687
2018-03-31 00:00:00
 
656
2018-01-31 00:00:00
 
623
2017-10-31 00:00:00
 
578
Other values (4679)
305841
ValueCountFrequency (%) 
2019-06-01 00:00:007010.1%
 
2017-07-31 00:00:006870.1%
 
2018-03-31 00:00:006560.1%
 
2018-01-31 00:00:006230.1%
 
2017-10-31 00:00:005780.1%
 
2015-07-01 00:00:005750.1%
 
2017-06-30 00:00:005580.1%
 
2016-06-30 00:00:005340.1%
 
2016-03-31 00:00:005310.1%
 
2017-03-31 00:00:004960.1%
 
Other values (4674)30314754.1%
 
(Missing)25175844.9%
 

Length

Max length19
Median length19
Mean length11.81773898
Min length3
Distinct count4085
Unique (%)1.7%
Missing319283
Missing (%)56.9%
Memory size4.3 MiB
Minimum1900-01-02 00:00:00
Maximum2099-12-12 00:00:00
Histogram

CREDIT-LIMIT/SANC AMT
Categorical

HIGH CARDINALITY
MISSING

Distinct count877
Unique (%)5.8%
Missing545685
Missing (%)97.3%
Memory size4.3 MiB
0
4624
30,000
 
917
25,000
 
881
15,000
 
706
10,000
 
572
Other values (872)
7459
ValueCountFrequency (%) 
046240.8%
 
30,0009170.2%
 
25,0008810.2%
 
15,0007060.1%
 
10,0005720.1%
 
50,0005490.1%
 
20,0005390.1%
 
40,0003460.1%
 
60,000214< 0.1%
 
1,00,000204< 0.1%
 
Other values (867)56071.0%
 
(Missing)54568597.3%
 

Length

Max length9
Median length3
Mean length3.044079994
Min length1

DISBURSED-AMT/HIGH CREDIT
Categorical

HIGH CARDINALITY

Distinct count72426
Unique (%)12.9%
Missing0
Missing (%)0.0%
Memory size4.3 MiB
0
 
32049
3,00,000
 
25150
2,00,000
 
15784
2,50,000
 
13091
4,00,000
 
13059
Other values (72421)
461711
ValueCountFrequency (%) 
0320495.7%
 
3,00,000251504.5%
 
2,00,000157842.8%
 
2,50,000130912.3%
 
4,00,000130592.3%
 
1,00,000128172.3%
 
3,50,000118722.1%
 
50,00092911.7%
 
1,50,00081481.5%
 
5,00,00072841.3%
 
Other values (72416)41229973.5%
 

Length

Max length12
Median length8
Mean length6.995494291
Min length1

INSTALLMENT-AMT
Categorical

HIGH CARDINALITY
MISSING

Distinct count50133
Unique (%)35.7%
Missing420509
Missing (%)75.0%
Memory size4.3 MiB
0/Monthly
28606
0
 
1934
1,00,000/Monthly
 
1735
50,000/Monthly
 
793
40,000/Monthly
 
623
Other values (50128)
106644
ValueCountFrequency (%) 
0/Monthly286065.1%
 
019340.3%
 
1,00,000/Monthly17350.3%
 
50,000/Monthly7930.1%
 
40,000/Monthly6230.1%
 
60,000/Monthly5070.1%
 
30,000/Monthly4960.1%
 
10,000/Monthly4930.1%
 
1,0,000/Monthly4750.1%
 
20,000/Monthly4520.1%
 
Other values (50123)10422118.6%
 
(Missing)42050975.0%
 

Length

Max length46
Median length3
Mean length5.452630321
Min length1

CURRENT-BAL
Categorical

HIGH CARDINALITY

Distinct count147445
Unique (%)26.3%
Missing233
Missing (%)< 0.1%
Memory size4.3 MiB
0
353347
2,50,000
 
760
1,00,000
 
427
3,00,000
 
389
50,000
 
356
Other values (147440)
205332
ValueCountFrequency (%) 
035334763.0%
 
2,50,0007600.1%
 
1,00,0004270.1%
 
3,00,0003890.1%
 
50,0003560.1%
 
1,50,0003180.1%
 
2,00,0002910.1%
 
3,50,000269< 0.1%
 
60,000207< 0.1%
 
40,000203< 0.1%
 
Other values (147435)20404436.4%
 
(Missing)233< 0.1%
 

Length

Max length12
Median length1
Mean length3.268782407
Min length1

INSTALLMENT-FREQUENCY
Categorical

MISSING

Distinct count9
Unique (%)< 0.1%
Missing425135
Missing (%)75.8%
Memory size4.3 MiB
F03
128888
F01
 
2377
F05
 
2334
F02
 
1778
F10
 
305
Other values (4)
 
27
ValueCountFrequency (%) 
F0312888823.0%
 
F0123770.4%
 
F0523340.4%
 
F0217780.3%
 
F103050.1%
 
F0413< 0.1%
 
F077< 0.1%
 
F086< 0.1%
 
F061< 0.1%
 
(Missing)42513575.8%
 

Length

Max length3
Median length3
Mean length3
Min length3

OVERDUE-AMT
Categorical

HIGH CARDINALITY
MISSING

Distinct count22385
Unique (%)5.1%
Missing118891
Missing (%)21.2%
Memory size4.3 MiB
0
408433
118
 
271
1
 
145
100
 
89
236
 
61
Other values (22380)
 
32954
ValueCountFrequency (%) 
040843372.8%
 
118271< 0.1%
 
1145< 0.1%
 
10089< 0.1%
 
23661< 0.1%
 
11953< 0.1%
 
534< 0.1%
 
232< 0.1%
 
50031< 0.1%
 
10,00030< 0.1%
 
Other values (22375)327745.8%
 
(Missing)11889121.2%
 

Length

Max length11
Median length1
Mean length1.697939177
Min length1

WRITE-OFF-AMT
Real number (ℝ)

MISSING
SKEWED
ZEROS

Distinct count1009
Unique (%)0.2%
Missing19123
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean295.631911998981
Minimum-30.0
Maximum8797356.0
Zeros540655
Zeros (%)96.4%
Memory size4.3 MiB

Quantile statistics

Minimum-30
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum8797356
Range8797386
Interquartile range (IQR)0

Descriptive statistics

Standard deviation19501.65161
Coefficient of variation (CV)65.96598952
Kurtosis110574.922
Mean295.631912
Median Absolute Deviation (MAD)0
Skewness283.4678315
Sum160150015
Variance380314415.4
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
054065596.4%
 
3528869< 0.1%
 
1436144< 0.1%
 
230563< 0.1%
 
536823< 0.1%
 
2383< 0.1%
 
1843102< 0.1%
 
852< 0.1%
 
75742< 0.1%
 
1422902< 0.1%
 
Other values (999)10360.2%
 
(Missing)191233.4%
 
ValueCountFrequency (%) 
-301< 0.1%
 
054065596.4%
 
12< 0.1%
 
41< 0.1%
 
71< 0.1%
 
ValueCountFrequency (%) 
87973561< 0.1%
 
70000001< 0.1%
 
36163121< 0.1%
 
24000001< 0.1%
 
20953111< 0.1%
 

ASSET_CLASS
Categorical

MISSING

Distinct count8
Unique (%)< 0.1%
Missing300376
Missing (%)53.6%
Memory size4.3 MiB
Standard
248670
SubStandard
 
5300
Doubtful
 
2578
Special Mention Account
 
2505
Loss
 
873
Other values (3)
 
542
ValueCountFrequency (%) 
Standard24867044.3%
 
SubStandard53000.9%
 
Doubtful25780.5%
 
Special Mention Account25050.4%
 
Loss8730.2%
 
13620.1%
 
01177< 0.1%
 
23< 0.1%
 
(Missing)30037653.6%
 

Length

Max length23
Median length3
Mean length5.404779226
Min length1

REPORTED DATE - HIST
Categorical

HIGH CARDINALITY
MISSING

Distinct count57846
Unique (%)10.7%
Missing19123
Missing (%)3.4%
Memory size4.3 MiB
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,
 
25369
20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,20170131,
 
6699
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,
 
3480
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,
 
2923
20200131,
 
2881
Other values (57841)
500369
ValueCountFrequency (%) 
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,253694.5%
 
20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,20170131,66991.2%
 
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,34800.6%
 
20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,29230.5%
 
20200131,28810.5%
 
20200131,20191231,20191130,28360.5%
 
20200131,20191231,26890.5%
 
20190831,20190820,25140.4%
 
20200131,20191231,20191130,20191031,23140.4%
 
20200131,20191231,20191130,20191031,20190930,22930.4%
 
Other values (57836)48772387.0%
 
(Missing)191233.4%
 

Length

Max length324
Median length162
Mean length174.0041455
Min length3

DPD - HIST
Categorical

HIGH CARDINALITY
MISSING

Distinct count134338
Unique (%)24.8%
Missing19647
Missing (%)3.5%
Memory size4.3 MiB
0
 
43756
XXX
 
39096
000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000
 
22330
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
10778
000
 
7369
Other values (134333)
417868
ValueCountFrequency (%) 
0437567.8%
 
XXX390967.0%
 
000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000223304.0%
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX107781.9%
 
00073691.3%
 
XXXXXX72191.3%
 
000DDD00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000059801.1%
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX55401.0%
 
00000047660.8%
 
00000000000043680.8%
 
Other values (134328)38999569.5%
 
(Missing)196473.5%
 

Length

Max length108
Median length45
Mean length52.22108643
Min length1

CUR BAL - HIST
Categorical

HIGH CARDINALITY
MISSING

Distinct count447072
Unique (%)82.5%
Missing19123
Missing (%)3.4%
Memory size4.3 MiB
,
 
31827
0,
 
9182
0,0,
 
2428
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
 
2039
0,0,,,,,,,,,,,,,0,0,
 
518
Other values (447067)
495727
ValueCountFrequency (%) 
,318275.7%
 
0,91821.6%
 
0,0,24280.4%
 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,20390.4%
 
0,0,,,,,,,,,,,,,0,0,5180.1%
 
0,0,0,4220.1%
 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3150.1%
 
250000,250000,250000,275< 0.1%
 
0,0,0,,,,,,,,,,,,,0,247< 0.1%
 
0,0,0,0,0,0,0,222< 0.1%
 
Other values (447062)49424688.1%
 
(Missing)191233.4%
 

Length

Max length357
Median length100
Mean length115.1796168
Min length1

AMT OVERDUE - HIST
Categorical

HIGH CARDINALITY
MISSING

Distinct count187341
Unique (%)34.6%
Missing19123
Missing (%)3.4%
Memory size4.3 MiB
,
 
35378
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
 
14323
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
 
11116
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
 
10319
0,
 
9245
Other values (187336)
461340
ValueCountFrequency (%) 
,353786.3%
 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0143232.6%
 
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,111162.0%
 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,103191.8%
 
0,92451.6%
 
0,0,79771.4%
 
0,,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,059931.1%
 
056441.0%
 
,,,,,,,,,,,,,51300.9%
 
0,0,0,0,0,0,0,0,0,0,0,0,046260.8%
 
Other values (187331)43197077.0%
 
(Missing)191233.4%
 

Length

Max length291
Median length36
Mean length43.03493841
Min length1

AMT PAID - HIST
Categorical

HIGH CARDINALITY
MISSING

Distinct count83417
Unique (%)15.4%
Missing20294
Missing (%)3.6%
Memory size4.3 MiB
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
 
63203
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
 
44600
,
 
41773
,,,,,,,,,,,,,,
 
13911
,,
 
11439
Other values (83412)
365624
ValueCountFrequency (%) 
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,6320311.3%
 
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,446008.0%
 
,417737.4%
 
,,,,,,,,,,,,,,139112.5%
 
,,114392.0%
 
,,,,,,,,,,,,,105261.9%
 
,,,,,,,,,,,,,,,103181.8%
 
,,,96921.7%
 
,,,,,,,,,,,,88241.6%
 
,,,,,,88031.6%
 
Other values (83407)31746156.6%
 
(Missing)202943.6%
 

Length

Max length308
Median length23
Mean length27.71264558
Min length1

TENURE
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count324
Unique (%)0.2%
Missing368107
Missing (%)65.6%
Infinite0
Infinite (%)0.0%
Mean28.801029382007606
Minimum0.0
Maximum856.0
Zeros18051
Zeros (%)3.2%
Memory size4.3 MiB

Quantile statistics

Minimum0
5-th percentile0
Q112
median23
Q336
95-th percentile72
Maximum856
Range856
Interquartile range (IQR)24

Descriptive statistics

Standard deviation32.17097453
Coefficient of variation (CV)1.117007802
Kurtosis33.20228367
Mean28.80102938
Median Absolute Deviation (MAD)13
Skewness4.098092905
Sum5551024
Variance1034.971602
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12478918.5%
 
36271024.8%
 
0180513.2%
 
60138442.5%
 
24116322.1%
 
4871351.3%
 
150880.9%
 
3050470.9%
 
642250.8%
 
1833840.6%
 
Other values (314)493388.8%
 
(Missing)36810765.6%
 
ValueCountFrequency (%) 
0180513.2%
 
150880.9%
 
23940.1%
 
35500.1%
 
416290.3%
 
ValueCountFrequency (%) 
8562< 0.1%
 
7521< 0.1%
 
7452< 0.1%
 
7041< 0.1%
 
6701< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

IDSELF-INDICATORMATCH-TYPEACCT-TYPECONTRIBUTOR-TYPEDATE-REPORTEDOWNERSHIP-INDACCOUNT-STATUSDISBURSED-DTCLOSE-DTLAST-PAYMENT-DATECREDIT-LIMIT/SANC AMTDISBURSED-AMT/HIGH CREDITINSTALLMENT-AMTCURRENT-BALINSTALLMENT-FREQUENCYOVERDUE-AMTWRITE-OFF-AMTASSET_CLASSREPORTED DATE - HISTDPD - HISTCUR BAL - HISTAMT OVERDUE - HISTAMT PAID - HISTTENURE
01FalsePRIMARYOverdraftNAB2018-04-30IndividualDelinquent2015-10-05NaN2018-02-27NaN37,352NaN37,873NaN37,8730.0Standard20180430,20180331,03000037873,12820,37873,,,,NaN
11FalsePRIMARYAuto Loan (Personal)NAB2019-12-31IndividualActive2018-03-19NaN2019-12-19NaN44,0001,405/Monthly20,797F03NaN0.0Standard20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,20180831,20180731,20180630,20180531,20180430,20180331,00000000000000000000000000000000000000000000000000000000000001100020797,21988,23174,24341,25504,26648,27780,28910,30020,31128,32267,33427,34547,35655,36764,37850,38939,40004,41059,42118,44601,44181,,,,,,,,,,,,,,,,,,,,,1452,,,,,,,,,,,,,,,,,,,,,,,,36.0
21TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2019-08-30NaNNaTNaN1,45,000NaN1,16,087NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,000000000000000000116087,116087,145000,145000,145000,145000,0,0,0,0,0,0,,,,,,,NaN
31TruePRIMARYAuto Loan (Personal)NBF2017-09-30IndividualClosed2013-09-272017-09-21 00:00:00NaTNaN3,00,000NaN0NaN00.0NaN20170930,20170801,20170731,20170630,20170531,20170430,20170331,20170228,20170131,20161231,20161130,20161031,20160930,20160831,20160731,20160630,20160531,20160430,20160331,20160229,20160131,20151231,20151130,20151031,20150930,20150831,20150731,20150630,20150531,20150430,20150331,20150228,20150131,20141231,20141130,20141031,000DDD0270260270260270240270270000320000000000000000260270250270270260270260270270260270000000000000000000000,,15925,23754,31494,39147,46713,54194,61590,68903,68903,76133,83281,90348,97336,104245,111076,124506,131108,137635,144088,150468,156776,163013,169180,175277,181305,187265,193157,198983,204743,210438,216069,221636,227140,232582,0,,1014,1014,1014,1014,1014,1014,1014,983,0,927,0,0,0,0,0,778,754,734,712,691,671,651,633,615,597,580,565,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,NaN
41TruePRIMARYTractor LoanNBF2016-02-29IndividualClosed2012-02-102016-02-01 00:00:00NaTNaN2,75,000NaN0NaN00.0NaN20160229,20160131,20151231,20151130,20151031,20150930,20150831,20150731,20150630,20150531,20150430,20150331,20150228,20150131,20141231,20141130,20141031,20140930,20140831,20140731,20140630,20140531,20140430,20140331,20140228,20140131,20131231,20131130,20131031,20130930,20130831,20130731,20130630,20130531,20130430,20130331,0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000,0,23658,23321,22989,46321,45662,45012,68030,67062,66108,88826,87562,86316,108747,107200,105675,127830,126012,124219,146111,144033,141984,163623,161295,159001,180398,177832,175302,196467,193672,190917,211860,208846,205875,226605,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,NaN
51FalsePRIMARYCredit CardNAB2018-04-30IndividualClosed2018-01-112018-03-13 00:00:00NaT50,0000NaN0NaNNaN0.0Standard20180331,20180228,20180131,00000000024650,17300,0,,,,,,,,NaN
61FalsePRIMARYAuto Loan (Personal)NAB2019-12-31IndividualActive2018-11-15NaN2019-12-15NaN5,00,0007,934/Monthly4,43,769F03NaN0.0Standard20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,000000000000000000000000000000000000000000443769,448365,453134,457687,462320,466802,471249,475779,480157,484620,488928,493570,497810,502016,,,,,,,,,,,,,,,,,,,,,,,,,,,,,84.0
71TruePRIMARYAuto Loan (Personal)NBF2017-09-30IndividualClosed2013-01-302017-09-21 00:00:00NaTNaN5,00,000NaN0NaN00.0NaN20170930,20170801,20170731,20170630,20170531,20170430,20170331,20170228,20170131,20161231,20161130,20161031,20160930,20160831,20160731,20160630,20160531,20160430,20160331,20160229,20160131,20151231,20151130,20151031,20150930,20150831,20150731,20150630,20150531,20150430,20150331,20150228,20150131,20141231,20141130,20141031,000DDD0000000000000870540270270000320000000000000000260270250270270260270260270270260270260270240270270260270,,0,0,0,0,0,0,0,13060,13060,25982,38767,51417,63933,76317,88570,112688,124556,136299,147917,159413,170787,182041,193176,204193,215093,225878,236549,247107,257553,267889,278115,288233,298244,308149,0,,0,0,0,0,3064,3064,3064,2972,0,2802,0,0,0,0,0,2350,2278,2216,2151,2087,2028,1968,1914,1663,1128,1097,1064,1034,1001,976,947,918,893,866,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,NaN
81TruePRIMARYAuto Loan (Personal)NBF2017-07-31IndividualClosed2013-06-112017-07-01 00:00:00NaTNaN4,00,000NaN0NaN00.0NaN20170731,20170601,20170531,20170430,20170331,20170228,20170131,20161231,20161130,20161031,20160930,20160831,20160731,20160630,20160531,20160430,20160331,20160229,20160131,20151231,20151130,20151031,20150930,20150831,20150731,20150630,20150531,20150430,20150331,20150228,20150131,20141231,20141130,20141031,20140930,20140831,000DDD0220210220190220220000270000000000000000210220200220220210220210220220210220210220190220220210220210220,,0,10487,20862,31126,41281,51328,51328,71101,71101,80830,90455,99978,109398,127940,137062,146088,155017,163851,172591,181237,189791,198254,206627,214911,223106,231214,239235,247171,255022,262790,270475,278077,285599,293040,0,,1722,1722,1722,1722,1722,1722,0,1625,0,0,0,0,0,1364,1321,1286,1249,1213,1179,1144,1113,1081,1048,1019,988,961,930,906,880,854,830,806,783,760,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,NaN
92FalsePRIMARYOverdraftPRB2017-03-31IndividualClosed2016-01-152017-03-27 00:00:002017-03-27NaN17,00,000NaN0NaN00.0NaN20170331,20170201,20170101,20161231,20161101,20161001,20160930,20160801,20160701,20160630,20160501,20160401,20160331,000DDDDDD000DDDDDD000DDDDDD000DDDDDD0000,,,1699997,,,154997,,,-3,,,149997,0,,,0,,,0,,,0,,,0,,,,,,,,,,,,,,NaN

Last rows

IDSELF-INDICATORMATCH-TYPEACCT-TYPECONTRIBUTOR-TYPEDATE-REPORTEDOWNERSHIP-INDACCOUNT-STATUSDISBURSED-DTCLOSE-DTLAST-PAYMENT-DATECREDIT-LIMIT/SANC AMTDISBURSED-AMT/HIGH CREDITINSTALLMENT-AMTCURRENT-BALINSTALLMENT-FREQUENCYOVERDUE-AMTWRITE-OFF-AMTASSET_CLASSREPORTED DATE - HISTDPD - HISTCUR BAL - HISTAMT OVERDUE - HISTAMT PAID - HISTTENURE
560834143389FalsePRIMARYTwo-Wheeler LoanNBF2018-09-01IndividualClosed2016-06-302018-07-03 00:00:002018-07-03NaN58,000NaN0NaNNaN0.0NaN20180701,20180602,20180501,20180401,20180302,20180202,20180104,20171201,20171101,20171001,20170901,20170801,20170703,20170601,20170501,20170403,20170301,20170202,20170103,20161202,20161104,20161005,20160905,20160803,20160705,DDD000DDD000000000000000000000000000000000DDD000DDD000000000000000000000000,,5984,,11968,14960,17952,17952,23936,26928,29920,32912,35904,35904,41888,,44880,,53856,53856,59840,59840,62832,65824,68816,71808,,,0,,0,0,0,0,0,0,0,0,0,0,0,,0,,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,,,,,,,,,NaN
560835143390TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2018-09-21NaNNaTNaN2,65,601NaN71,057NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,00000000000000000000000000000000000000000000000000071057,71057,136800,136800,136800,136800,136800,136800,197627,197627,197627,197627,197627,197627,265601,265601,265601,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,NaN
560836143390FalsePRIMARYProperty LoanPRB2019-12-31IndividualActive2019-08-31NaN2019-12-09NaN18,22,000NaN17,96,573NaN00.0NaN20191231,20191130,20191031,20190930,0000000000001796573,1805126,1813602,1822000,0,0,0,0,,,,,121.0
560837143391TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2018-09-22NaNNaTNaN2,75,630NaN73,890NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180930,00000000000000000000000000000000000000000000000000073890,73890,142050,142050,142050,142050,142050,142050,204926,204926,204926,204926,204926,204926,275630,275630,275630,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,NaN
560838143393TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2018-11-23NaNNaTNaN3,0,733NaN1,42,446NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,0142446,155709,168804,181731,194494,207093,207093,231812,243935,255903,267719,279384,290900,300733,300733,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,,NaN
560839143393FalsePRIMARYTractor LoanNBF2019-12-31IndividualClosedNaTNaNNaTNaN2,50,000NaN0NaN00.0Standard20171231,20171130,20171001,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,20170131,000000DDD0540240240000000000000000000,0,,67747,89638,111197,109562,130470,151135,171412,191662,211323,0,0,,22920,22920,22920,0,0,0,0,0,0,,,,,,,,,,,,,12.0
560840143393FalsePRIMARYTractor LoanNBF2019-12-31IndividualActive2017-10-31NaNNaTNaN3,0,000NaN0NaN00.0Standard20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190301,20190228,20190131,20181201,20181130,20181031,20180930,20180801,20180701,20180630,20180501,20180430,20180331,20180228,20180131,20171231,20171130,000000000000000000027028028DDD026030DDD000000000DDDDDD000DDD0000000000000000000,0,14208,28265,42149,55874,69591,83001,96293,,122484,135286,,160108,172497,184813,,,220795,,244136,255560,267150,278310,289342,300000,0,0,0,0,0,0,125,125,125,,125,125,,0,0,0,,,0,,0,0,0,0,0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,24.0
560841143393FalsePRIMARYAuto Loan (Personal)NBF2020-01-31GuarantorActive2016-11-28NaN2020-01-16NaN3,93,819NaN1,1,687NaNNaN0.0Standard20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,20181130,20181031,20180901,20180831,20180731,20180630,20180531,20180430,20180331,20180228,20180131,20171231,20171130,20171031,20170930,20170831,20170731,20170630,20170531,20170430,20170331,20170228,XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX031XXXXXX031DDD031061030XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX101687,111161,120514,129749,138867,147870,156759,165535,174199,182754,191201,199540,207774,215904,223931,231855,,247405,255033,262564,269999,277341,284589,291745,298811,305787,312675,319476,326190,332820,339365,345828,352208,358508,364728,370869,,,,,,,,,,,,,10897,,,10763,,10771,21785,10885,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,48.0
560842143394TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2018-12-20NaNNaTNaN2,50,643NaN1,32,487NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,0132487,132487,132487,191426,191426,191426,191426,191426,191426,250643,250643,250643,250643,250643,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,NaN
560843143395TruePRIMARYTractor LoanNBF2020-01-31IndividualActive2018-12-31NaNNaTNaN2,0,428NaN1,5,499NaN00.0NaN20200131,20191231,20191130,20191031,20190930,20190831,20190731,20190630,20190531,20190430,20190331,20190228,20190131,20181231,0105499,105499,105499,152431,152431,152431,152431,152431,152431,200428,200428,200428,200428,200428,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,,,,,,,,,,,,,,NaN